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■^r Abstract. The tight upper bound on the state complexity of the reverse 

of 7?.-trivial and ^-trivial regular languages is 2"~^ for languages of the 
state complexity n. The witness for 7i-trivial regular languages is over a 
^ ternary alphabet while for ^-trivial regular languages over an (n — 1)- 

I_J element alphabet. In this paper, we prove that the size of the alphabets 

^j, cannot be improved, that is, the bound can be met neither by a binary 

• 7?.-trivial regular language nor by a ^-trivial regular language over an 

fj {n — 2)-element alphabet. We also present tight bounds for binary 7?.- 

I I trivial and (n — 2)-element ,y-trivial regular languages. Tight bounds for 

(n — fe)-element ^7-trivial regular languages, for 2 < fc < n — 3, are open. 

> 

^^ 1 Introduction 

l> 

^D Regular languages of simple forms play an important role in mathematics and 

T^lj- computer science. The reader is referred to, e.g., |1I7I15) for a few applications 

C^ of jT'-trivial or piecewise testable languages. 

f^ The state complexity of a regular language is the number of states of a mini- 

. . mal deterministic finite automaton (DFA) accepting the language. Equivalently, 

J> the state complexity coincides with the quotient complexity [J, which is the 

number of distinct (left) quotients of a regular language. 

The reverse of a machine or of a language is one of the classical operations 
Cd in automata and formal language theory. However, in comparison with, e.g., 

boolean operations, the state complexity of the reverse of regular languages is 
exponential in the worst case, and there exist binary witness languages of the 
state complexity n with the reverse of the state complexity 2", see |14l20j . This 
even holds true for union-free regular languages defined by regular expressions 
without the union operation [S]. 

In this paper, we consider languages defined by Green's equivalence relations, 
namely 7?,-trivial and J^-trivial regular languages. Let Af be a monoid and s and 



* Research supported by VEGA grant 2/0183/11 and by grant APVV-0035-10. 
** Research supported by GACR grant P202/11/P028 and by RVO: 67985840. 



2 G. Jiraskova, T. Masopust 

t be two elements of M. Green's relations £, TZ, J ^ and "H on M are defined 
as follows: {s^t) G £ if and only if M ■ s — M ■ t, (s,t) E TZ ii and only if 
s- M = t- M, {s,t) € J a and only if M • s • Af = M • t • M, and H = £ n 7^. 
For p G {C,TZ,J','H}, M is p-trivial if (s,t) G p implies s = t, for all s,i in 
Af . A language is p-trivial if its syntactic monoid is p-trivial. Note that %- 
trivial regular languages coincide with star-free languages [13, Chapter 11] and 
£-trivial, 7?.-trivial and J^-trivial regular languages are all star-free languages. 
Moreover, j/-trivial regular languages are both £-trivial and 7?.-trivial. 

From the computer science viewpoint, a regular language is 7?.-trivial if and 
only if it is a finite union of languages of the form IJlaiU20,2S3 ■ ■ ■ ^IcikS* , 
where fc > 0, Oj G Z', and Si Q S \ {ui} , or if and only if it is accepted by 
a partially ordered minimal DFA [3 . Similarly, a regular language is ^J-trivial 
(also called piecewise testable) if and only if it is a finite boolean combination 
of languages of the form Z'*aiZ'*a2Z'* . . . Z:*akS*, where fc > and a^ G S, or 
if and only if the minimal DFAs for both the language and the reverse of the 
language are partially ordered [16I17J . Other automata representations of these 
languages can be found, e.g., in [8] and the literature therein. 

In 1985, Stern TS^ suggested a polynomial-time algorithm of order O(n^) in 
the number of states and transitions of the minimal DFA to decide whether a 
regular language is J'-trivial. Recently, Trahtman [TH] improved this result to 
a quadratic-time algorithm. Polak and Klima [12j have further investigated the 
properties of piecewise testable languages and introduced so-called biautomata 
that can be used to verify piecewise testability of a regular language, but which 
can be of exponential size in the worst case as shown in jllj . 

In |10 | , we have shown that the upper bound on the state complexity of the 
reverse of 7?.-trivial and J'-trivial regular languages is 2"^^ for languages of the 
state complexity n and that this bound can be met by a ternary 7?.-trivial regular 
language. Furthermore, we have conjectured that an (n— l)-element alphabet is 
sufficient for J'-trivial regular languages of the state complexity n to meet the 
upper bound. A proof can be found in "S]. 

In this paper, we prove the optimality of the alphabets. Namely, we prove 
that the bound on the state complexity of the reverse can be met neither by a 
binary 7^-trivial regular language nor by a j7-trivial regular language over an 
(n — 2)-element alphabet. We also show tight upper bounds for binary 7?.-trivial 
regular languages. As a result, we give a characterization of tight upper bounds 
for 7?.-trivial regular languages depending on the state complexity of the language 
and the size of the alphabet. Finally, we prove a tight upper bound for (n — 2)- 
element J^-tiivial regular languages and a few tight bounds for binary ^J-trivial 
regular languages. The case of (n — fc)-element J^-trivial regular languages, for 
2 < fc < n — 3, is left open. 



2 Preliminaries and Definitions 

We assume that the reader is familiar with automata and formal language the- 
ory [13]. The cardinality of a set A is denoted by \A\, and the powerset of A is 
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denoted by 2 . An alphabet is a finite nonempty set. The free monoid generated 
by an alphabet S is denoted hj S* . A string over S is any element of S* . The 
empty string is denoted by e. 

A nondeterministic finite automaton (NFA) is a 5-tuple M — (Q, S, 6, Qo, F), 
where Q is the finite nonempty set of states, U is the input alphabet, Qq C Q is 
the set of initial states, F C Q is the set of accepting states, and 5 : Qx S ^ 2^ 
is the transition function that can be extended to the domain 2^ x E* . The 
language accepted by M is the set L{M) = {w e E* \ S{Qa,w) n i^ 7^ 0}. The 
NFA M is deterministic (DFA) if \Qo\ = 1 and \S{q, a)\ = 1 for every q in Q and 
a in S. In this case we identify singleton sets with their elements and simply 
write q instead of {q}. Moreover, we consider the transition function 6 to he a 
total mapping from Q x S to Q that can be extended to the domain Q x S* . 
A non-accepting state d € Q such that d{d, a) — d, for all a in S, is called a 
dead state. Two states of a DFA are distinguishable if there exists a string w 
that is accepted from one of them and rejected from the other; otherwise they 
are equivalent. A DFA is minim,al if all its states are reachable and pairwise 
distinguishable. 

A DFA M = {Q, S, S, Qq, F) is partially ordered if the reachability relation 
:< on the set of states, defined by p ^ (7 if there exists a string w in S* such that 
S{q, w) — p, is a partial order. 

The subset automaton of an NFA M = {Q,S,6,Qo,F) is the DFA M' = 
(2'5j S, S' , Qo, F') constructed by the standard subset construction. 

The reverse w of a string w is defined by e = e and (va) = av , for v in 
E* and a in S. The reverse of a language L is the language L^ = {w^ \ w G L}. 
The reverse of a DFA M is the NFA M^ obtained from M by reversing all 
transitions and swapping the role of initial and accepting states. 

The following result says that there are no equivalent states in the subset 
automaton of the reverse of a minimal DFA. We use this fact in the paper when 
proving the tightness of upper bounds. By this fact, it is sufficient to show that 
the corresponding number of states is reachable in the subset automaton since 
the distinguishability always holds. 



Fact 1 ([2j) All states of the subset automaton corresponding to the reverse of See Appendix, p. 13 
a minimal DFA are pairwise distinguishable. D for a proof. 

In this paper we implicitly use the characterizations that a regular language 
is 7?.-trivial if and only if it is accepted by a minimal partially ordered DFA and 
that it is jT-trivial if and only if both the language and its reverse are accepted 
by minimal partially ordered DFAs. 

Note that this immediately implies that J'-trivial regular languages are closed 
under reverse. However, 7?,-trivial regular languages are not closed under reverse 
since the 7?,-trivial regular language of Fig. 1 in flO^ is not J'-trivial, hence the 
minimal DFA for the reverse is not partially ordered. 

The state complexity of a regular language L, denoted by sc(L), is the number 
of states in the minimal DFA recognizing the language L. 

The following lemma shows that in some cases we do not need to distinguish 
between DFAs with and without dead state. In particular, we can get the result 
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for DFAs without a dead state immediately from an anafogous result for DFAs 
with a dead state or vice versa|f] 

Lemma 1. Let L be a regular language. Then sc{L) = sc(L^), where L'^ denotes 
the complement of L. In particular, we have sc(i^) = sc((L'^)^). 

Proof. Let M be a minimal DFA accepting L. Then M'^ constructed from M by 
swapping accepting and non- accepting states is a minimal DFA accepting L*^. 
The second part now follows by the observation that {L^Y = {L'^)^. D 

This lemma says that if a DFA M has a dead state and reaches the upper 
bound, then the same result for DFAs without a dead state can be derived from 
the complement of M if it does not have a dead state. However, note that Tablejl] 
demonstrates that there are cases where this technique fails because both the 
DFA and its complement have a dead state. 

Immediate consequences of this lemma combined with the known results are 
formulated below. 

Corollary 1. (i) There exist ternary TZ-trivial regular languages Li and L2 
whose automata representation has and does not have a dead state, respectively, 
with sc{Li) = sc{L2) = n and sc(Lf ) ~ sc(L2^) = 2"^^. (ii) There exist J- 
trivial regular languages Li and L2 over an alphabet S with \I]\ > n — 1 whose 
automata representation has and does not have a dead state, respectively, with 
sc(Li) = sc(i2) = n and sc(Lf ) = sc(Lf ) = 2"-^. 

See Appendix, p. [T3j Proof. Using LemmalTl (i) follows from jTOl Lemma 3, p. 232] since the automa- 

for the figures. ton used there has a dead state and its complement does not, while (ii) follows 

from the automaton used in [SJ v2. Theorem 5, p. 15]. D 

3 7?^-trivial regular languages 

Recall that the state complexity of the reverse of 7?,-trivial regular languages 
with the state complexity n is 2""^ and there exists a ternary witness language 
meeting the upper bound. We now prove that the ternary alphabet is optimal, 
that is, the upper bound cannot be met by any binary 7?,-trivial regular language. 

Lemma 2. Let L be a binary TZ-trivial regular language with sc{L) = n, where 
n>2. Then sc(L^) < 2"-^ +n~l. 

Proof. Let M — ({1, . . . , n}, {a, 6}, 5, 1, F) be a partially ordered DFA with n 
states such that i di j implies i < j. Let M' be the subset automaton of the 
NFA M^. We show that M' has at most n — 1 reachable sets that contain n—1. 
By Lemma [1] we can assume that state n is accepting, otherwise we take the 
complement. Then there are two cases: (i) both symbols a, b go from state n—1 
to state n, or (ii) without loss of generality, the transition under b goes from 
n — 1 to n and there is a self-loop under a in state n — 1. 



We are grateful to an anonymous referee for pointing out this observation. 
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Fig. 1. There are three trees, namely T4, = {0, 1, 2, 3, 4}, Ts = {5, 6, 7, 8}, and Tg = {9}; 
6-transitions are dotted. 



In the first case, no sets without state n — 1 are reachable in M', except for F. 
Hence the upper bound is at most 2"^^^ + 1. 

In the second case, all sets not containing state n—1 must be reachable by 
strings in a* . We now prove that at most n — 1 such sets are reachable in M' . 
It is sufficient to prove that F ■ a"~^ = F ■ a"~^, where • is the transition function 
of the subset automaton M' . 

The subautomaton of M, defined by restricting to the alphabet {a}, is a 
disjoint union of trees Tq where 5{q^ a) — q and Tq consists of all states that can 
reach g by a string in a*; see Fig. [llfor illustration. Let k be the depth of Tq, 
and let F' = F C^Tq. 11 q (^ F', then F' ■ a'' ^ Tq. If fc ^ F', then F' ■ a'' = %. In 
both cases F' ■ a^ = F' ■ a'^^^. Now F ■ a™ is a disjoint union of such F' ■ a™. 
By the assumptions, all trees are of depth at most n — 2; recall that there is no 
transition under a from n — 1 to n. Hence F ■ a""^ — F ■ a"^^ follows. D 

The following lemma proves the lower bound 2"^^ on the state complexity 
of the reverse of binary 7?.-trivial regular languages. 

Lemma 3. For every n > 3, there exists a binary TZ-trivial regular language 
with sc(L) — n such that sc(L^) > 2"~^. 

Proof. Consider the language L accepted by the partially ordered binary n-state 
DFA Af shown in Fig. [2J We now show that each subset of {0, 1, . . . , n — 2} is 
reachable in the subset automaton of the NFA M^ . 

The proof is by induction on the size of subsets. The subset {0} is the initial 
state of the subset automaton. Each subset {0, ii, 12, . . . , ifc} of size fc + 1, where 
1 < ii < ^2 < • • • < ifc < 71 — 1, is reached from the subset {0, 12 — Ji, . . . , Jfe — ii} 
of size k by the string ab^^^^. D 

Using a computer program we have computed a few tight bounds summarized 
in Table n] Note that the upper bound 2"~^ + (n — 1) is met by a DFA for L 
if n < 6, but not if n = 7. In addition, more than 2"^^ states are reachable if 
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dead state 
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2"-2+n-l 


2n-2 


Witness 
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1/2 


1/2 
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L2 = a*6(a + 6)* 


3 


4 
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4 
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L3 = 6* + h*aL2 


4 


7 


7 


7 


4 


L4 = h* aLz 


5 


12 


12 


12 


8 


L5 = h*a{aL-i + feL2) 


6 


21 


21 


21 


16 


L6 = 6*a(6*a + L5) 


7 


34 


34 


38 


32 


h*ab*a{a + b){e -\- aLs + 6L2) 


8 


55 


64 


71 


64 





Table 1. The tight bounds for the reverse of binary 7?.-trivial regular languages. The 
witness languages are e7-trivial. 



n < 7, but not if n = 8. By Lemma [T] this means that for n = 8, the worst-case 
minimal partially ordered DFA has a dead state and so does its complement. 

It is worth mentioning that the witness languages are even jT-trivial, hence 
these tight upper bounds also apply to binary J'-trivial regular languages dis- 
cussed below. 

We now prove that if n > 8, then the upper bound is 2"^^ in the case of 
binary alphabets. 



Lemma 4. Let n > 8 

sc(L) = n. Then sc(i^) 



and let L he a binary TZ-trivial regular language with 
< 2"^^, and the bound is tight. 



Proof. Consider a minimal partially ordered n-state DFA M over the alphabet 
{a,b}. If there are two maximal states, then one of them is accepting and the 
other is the dead state. Then the accepting state appears in all reachable subsets 
of the subset automaton of the NFA M^, while the dead state appears in no 
reachable subset. Hence the upper bound is 2"~^. If M has only one maximal 
state, and the state is the dead state, we take the complement that has the same 
state complexity by Lemma [T] and does not have the dead state. 

Thus, it remains to investigate the upper bound for minimal partially ordered 
DFAs with no dead state. Assume that n is the maximal state. Note that if a 
minimal binary partially ordered DFA has at least four states, there is a path of 




Fig. 2. A binary 7?.-trivial regular language meeting the bound 2" ^ for the reverse. 
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Fig. 3. Path of length two, where x £ {a, b}. 



length two in the automaton. Consider three last states of such a longest path, 
say n — 2 — > n — 1 — > n. In particular, there is no longer path from n — 2 to n. 
Note also that n — 1 is not accepting, otherwise it is equivalent to n. As above in 
Theorem [21 we can show that to reach the upper bound, the situation must be as 
depicted in Fig.|3] Indeed, if both a, b go from n — 2 to ?i — 1, then there is at most 
one subset containing n and n — 1 and not n — 2, namely F ■ b. Now, compute the 
number of reachable subsets containing n and n ~ 1, but not containing n — 2. 
Recall that F ■ a^ , k > 0, reaches at most n — 1 different subsets Xi, . . . , X„_i. 

If a; = a, then we have the following possibilities: (i) b goes to n; (ii) b goes 
to a state p from which one transition goes to n and one is a self-loop; (iii) b is 
a self-loop in n — 2. 

In the first case, there is no such subset because n — 1 is introduced by 6, 
which also introduces n — 2. 

In the second case. Up goes to n under a, then 6 is a self-loop in p, and there 
are at most {n — 2) subsets, F -b-b^ with fc > 0, containing n and n — 1 and not 
containing n — 2. If p goes to n under 6, then a is a self-loop in p, and there are 
at most (n — 1) subsets, F ■ a'^ ■ b with fc > 0, containing n and n — 1 and not 
containing n — 2. 

In the third case, all the reachable subsets are F ■ a^ ■ b ■ h^ with k,i > 
0. As there are at most n — 2 subsets F ■ b ■ b^, Yi, . . . , F„_2j the number of 
these subsets corresponds to the number of pairs {{Xi,Yj) \ i > j}, which is 
{{n - l){n -2) + {n- 2))/2 = n{n - 2)/2. 

Ii X = b, then we have the following possibilities: (i) a goes to n; (ii) a goes 
to another state p from which one transition goes to n and one is a self-loop; 
(iii) a is a self-loop in n — 2. 

In the first case, there is only one subset containing n and n — 1 and not 
containing n — 2, namely F ■ b. 

In the second case, ii p goes to n under a, then 6 is a self-loop in p and there 
are at most three subsets, F ■ b, F ■ ba and F ■ ab, containing n and n—1 and 
not containing n — 2. Ii p goes to n under b, then a is a self-loop in p and there 
are at most (n — 1) subsets, F ■ a ■ b with fc > 0, containing n and n—1 and not 
containing n — 2. 

In the third case, all these subsets are reachable only by strings containing 
one b, that is, the subsets are reachable by F ■ a^ ■ b ■ a^ . Their number is at 
most the number of pairs {{Xi,Zj) \ i > j}, where F ■ b ■ a^ reaches subsets 
Zi, . . . , Zn-2, which is n{n — 2)/2. 

Summarized, if n > 4, then the subset automaton of the NFA M^ has at 
most 2""'^ -I- min C^^"" ,2""'^) -f (n — 1) reachable states. Note that we get at 
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least 2"^-^ subsets only in the case of 4 < n < 7. Hence if n > 8, then the upper 
bound is reachable by automata with two maximal states, which means that the 
upper bound is 2"^^ if n > 8. D 

Denote by fk {n) the state complexity function of the reverse on binary 7?,- 
trivial regular languages over a fc-letter alphabet defined by 

fkin) = max{sc(L^) | L C Z"*, \S\ ~ k,L \s 7?.-trivial regular, and sc(i) — n\. 

Using this notation, we can summarize our results in the following theorem. 

Theorem 2. Let n> 1 and let fk{n) he the state complexity of the reverse on 
TZ-trivial regular languages over a k-letter alphabet. Then 

fi{n) = n, 

1, ifn^l, 

, , , . 2"-2 + n-l, if2<n<6, 
^^(")=^34, ^fn^7r 

2"~^, otherwise, 

fi{n) = fk{n) — 2"^ for every k with fc > 3. 

Proof. Since the reverse of every unary language is the same language, we have 
/i(n) = n. The upper bounds on /2 are given by Lemmas ^ and H and by our 
calculations in the case of n = 7. The lower bounds in the case of 1 < n < 7 also 
follow from the calculations, while the case of n > 8 is covered by Lemma [3| The 
result for f^ is from CoroUarylT] Since adding new symbols to the ternary witness 
automata does not change the proofs of reachability and distinguishability in the 
ternary case, the upper bound is tight for every k with fc > 3. D 

4 jT-trivial regular languages 

As any j/-trivial regular language is 7?.-trivial, the upper bound for 7?,-trivial 
regular languages applies. To prove the results of this section, we first define 
Simon's condition on 7?,-trivial regular languages to be ^J-trivial. 

Any DFA M = {Q,S,S,qQ,F) can be turned into a directed graph G{M) 
with the set of vertices Q, where a pair {p,q) € Q x Q is an edge in G{M) if 
there is a transition from p to g in M . For F C S^ we define the directed graph 
G{M, r) with the set of vertices Q by considering only those transitions that 
correspond to letters in 7^. For two states p and q of M, we write p < q ii p ^ q 
and p =/= q. A state p is maximal if there is no state q such that p ^ q. 

For a directed graph G = {V, E) and p E V, the set 

C(p) = {q eV \ q = p or there is a directed path from p to q} 
is called the component of p. 
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Definition 1 (Simon's condition). A DFA M with an input alphabet S sat- 
isfies Simon's condition if, for every subset F of S , each component of G{AI, F) 
has a unique maximal state. 

Simon [T7] has shown the following result. 

Fact 3 An TZ-trivial regular language is J'-trivial if and only if its minimal 
partially ordered DFA satisfies Simon's condition. 

It is worth mentioning that it is more efficient to use Trahtman's condition 
to decide whether an 7?.-trivial regular language is j7-trivial. The improvement 
Trahtman has shown is the following: for a state p, let S{p) denote the set of 
symbols under which there is a self-loop in p. Then an 7?,-trivial regular language 
is J'-trivial if and only if its minimal partially ordered DFA satisfies that, for 
every state p, the component of G{M, S(jj)) containing p has a unique maximal 
state, see [19] for more details. 

Concerning the lower bound state complexity, note that it was shown in [B] 
that there are finite languages over a binary alphabet whose reverse have a blow- 
up of 3 • 2?^^ — 1, for even n, and of 2t^ — 1, for odd n. As every finite language 
is j/-trivial, we obtain at least these lower upper bounds for binary j7-trivial 
regular languages. 

Using Simon's result we immediately obtain the following lemma. 

Lemma 5. Let F Q S. If a partially ordered DFA M over S satisfies Simon's 
condition, then the DFA M' (not necessarily connected) obtained from M by 
removing transitions under symbols from F also satisfies Simon's condition. 

Proof. Let S' — S \ F . By Fact l3J each component of G{M, S') has a unique 
maximal state and remains partially ordered. n 

We now prove the main result of this section. 

Theorem 4. At least n—1 symbols are necessary for a J'-trivial regular language 
of the state complexity n to reach the bound 2"~^ on the number of states in the 



Proof. We prove by induction on jQI that for every partially ordered DFA Af = 
(<5, E, S, qo, F), where \E\ < \Q\ — 1, satisfying Simon's condition, the number of 
sets reachable in the subset automaton of the NFA M^ is strictly less than 2l'3l-i. 
The basis, \Q\ = 3, holds since then the automaton is over a unary alphabet, 
which means that L{M^) = L{M), hence sc{L{M^)) ^ sc(L(M)) < 3 < 2^. 

Assume that, for some k > 3, the claim holds for every partially ordered DFA 
satisfying Simon's condition with at most k states. Let M be a partially ordered 
DFA satisfying Simon's condition with n = k-\-l states, and assume that 21'^'"-'^ 
subsets are reachable in the subset automaton of the NFA M^. Without loss of 
generality, we may assume that the maximal state of M is denoted by n, and 
that it is accepting. 
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Fig. 4. The partially ordered DFA M'; F = EpU Eq 



Let R denote the set all states different from n with a transition to n, that is, 
R = {q £ Q\{n} I 6{q, a) = n,a £ E}. Let F denote the set of symbols connect- 
ing the states of R with the state n, that is, F = {a e S \ S{R, a) n {n} ^ 0}. 
Let M' be the A:-state subautomaton of M obtained by removing the state n 
and all transitions labeled by symbols from 7^. In particular, the automaton M' 
satisfies Simon's condition by Lemma [sj Let max(i?) be the set of all states 
of R that are maximal in M' . Note that every state of max(i?,) must be non- 
accepting, otherwise it appears in all reachable subsets of the subset automaton 
of M^ because of self-loops or transitions from n. 

For a state p in max(i?), let Cp denote the component of p in G{M'), and 
let Up — {a & S \ S{p, a) = n} C F denote the set of all labels under which p is 
connected to n, see Fig. Ill for illustration. Note that if Cp and Cq are connected, 
and p j^ q, then p and q are two maximal states of the component containing 
CpU Cq, which is a contradiction because M' satisfies Simon's condition. 

We now prove that for every state p in max(i?), there exists a letter in Up 
which does not appear in Eq for any other state q in max(i?). For the sake of 
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contradiction, assume that there is a state p in max(i?) with Sp C[J 
Since all the subsets containing n and p and not containing any q different 
from p are reachable, and all states of max(i?) are non-accepting, the state p is 
introduced to the subset from n by a transition under a letter from Up which also 
introduces a state q into that subset. Since p and q are maximal, they cannot be 
later eliminated from the subset. 

Thus, we have shown that for each state p in max(i?), there exists a letter ap 
in Up that does not appear in Sg for any state q in max(i?) with q ^ p. Let F' = 
{(jp I p € max(i?)} denote the set of these letters. Then \F'\ = \ max(i?)|. 

Construct a new partially ordered DFA M" from M' by joining all states 
of max(i?) into one state. As they are all non-accepting, the subset automaton 
of the reverse of M" has the same number of reachable states as the subset 
automaton of the reverse of M' . Moreover, as all sets that contain n and do not 
contain anything from max(i?) are reachable in the subset automaton of M^, the 
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ai,...,a„_4 ai,...,a„_3 

n- 1 




Fig. 5. The witness minimal partially ordered DFA M satisfying Simon's condition. 



automaton M" reaches the upper bound on the reverse. Thus, if \S\ < \Q\ — 1, 
then we have in M" that \S"\ = |i:|- | niax(i?)| < |Q|- | max(i?)|-l = \Q"\-l. 
But, by the induction hypothesis, the automaton M" cannot reach the upper 
bound on the reverse, which is a contradiction. D 

Using this result, we can prove the tight upper bound on the state complexity 
of the reverse for j7-trivial regular languages over an (n — 2)-letter alphabet. 

Theorem 5. Let n > 3 and let L be a J^ -trivial regular language over an (n— 2)- 
letter alphabet with sc{L) = n. Then sc(L^) < 2"~^ — 1, and the bound is tight. 

Proof. The upper bound follows from Theorem [4] To prove tightness, consider 
the J'-trivial regular language accepted by the minimal DFA M = ({0, 1, . . . , n — 
1}, {ai, . . . , an-2}, S, 1, {0}) depicted in Fig. p\ The transitions under a symbol 
Uj in S are defined by 

{i + I, if i < j < n - 2, 
0, if i=j -I, 
i, otherwise. 

In the subset automaton of the NFA AI^, the initial state is state {0} and, 
if 1 < fc < n — 2, then every {k + l)-element set {0, ii, 12, . . . , ik} with 2 < ii < 
i2 <■■■< ik ^ n— lis reached from the /c-element set {0, 12, ... , ik} by a^^ . This 
gives 2"~^ reachable states. Note that the subset {0, 1} is not reachable, but all 
subsets of cardinality at least three containing and 1 are reachable since every 
set {0, l,ii, . . . ,ife} is reached from the set {0, 2,^2, . . . ,ife} by the symbol a^j. 
This completes the proof. D 

By Theorem [4j the upper bound for binary j7-trivial regular languages is 



at most 2" '^ + min( 



i.in-2) r,n-3 



2" '^) + (n — 1). Moreover, the witness languages 



presented in Table [T] are j7-trivial. Note that if n > 8, we need a dead state 
to reach the upper bound 2"~^. Hence it is an open problem what is the tight 
upper bound for binary j/-trivial regular languages in the case of n > 8. 

Corollary 2. Let L be a binary J'-trivial regular language with sc(L) = n, 
where n > 4, then sc(L^) < 2"-^ + min( "<"^''^^ 2"-^) + {n - 1). A few tight 
upper bounds for small n are given in Table [7| 
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5 Conclusions 

We have characterized tight upper bounds on the state complexity of the reverse 
of 7?,-trivial regular languages depending not only on the state complexity of the 
language, but also on the size of the alphabet. This characterization also gives 
upper bounds for jT-trivial regular languages. However, we have shown that they 
are not reachable for ^J-trivial regular languages of the state complexity n over an 
(n — fc)-clemcnt alphabet, where 2 < fc < n — 3. Namely, we have shown the tight 
upper bounds for (n — 1)- and (n — 2)-element ^J-trivial regular languages, but 
(except for a few examples for binary ^J-trivial regular languages) the problem 
what are the tight upper bounds for ^J-trivial regular languages over an alphabet 
of a lower cardinality is open. 

Acknowledgements. The authors gratefully acknowledge very useful sugges- 
tions and comments of anonymous referees on the previous version of this work. 
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Appendix 

Fact [l] All states of the subset automaton corresponding to the reverse of a 
minimal DFA M are pairwise distinguishable. 

Proof. Let g be a state of the NFA M^. Since q is reachable in M, there exists 
a string Wq which is accepted by M^ from the state q. Moreover, the string 
Wq cannot be accepted from any other state of M^ because M is deterministic. 
Next, two distinct subsets of the subset automaton differ in a state q of M^, 
and the string Wq distinguishes these two subsets. D 



Figures from Corollary [T] 



a, b, c ^ — -- 

ao ) — - — *{ 91 



a, b 
a,b,c ^ -^^ a,c ^"^"^ 




Fig. 6. Figure of [lOl Lemma 3, p. 232]. 
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fli , . . . , a„_3 oi , . . . , a„_2 




Fig. 7. Figure of [5, v2, Tlieorem 5, p. 15] 



